Optimized Clustering Techniques with Special Focus to Biomedical Datasets

نویسنده

  • Anusuya S. Venkatesan
چکیده

The clinical data including clinical test results, MRI images and drug responses of patients are documented and analyzed with machine learning and data mining tools. The scale and complexity of these datasets is a big challenge to machine learning and data mining community as the data is of mixed type. The extraction of meaningful or desired information from these datasets provides knowledge in decision making process which in turn helps for the diagnosis and treatment of the diseases. Biomedical datasets are a collection of data with diverse types as it involves images, clinical studies, statistical reports etc. The recent researches have focused on different clustering and classification methods to manage and analyze the biomedical datasets. The objective of this chapter is to cluster or classify the patterns of interest from Brain MRI images, Liver disorder and Breast cancer datasets using efficient clustering methodologies. Among the different algorithms in data mining for clustering, classification, visualization and interpretation, K Means, Fuzzy C Means and Neural Networks(NN) are frequently used for clustering and classification of biomedical datasets. The performance of these methods are greatly influenced by the initialization of K value and its convergence speed. This chapter discusses about FCM and K Means clustering methods and its optimization with meta heuristics such as Particle Swarm Optimization (PSO) and Quantum Particle Swarm Optimization (QPSO). The experimental section of this paper exhibits analysis in terms of Intra cluster distances, elapsed time and Davis Bouldin Index (DBI).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Empirical Comparison of Distance Measures for Multivariate Time Series Clustering

Multivariate time series (MTS) data are ubiquitous in science and daily life, and how to measure their similarity is a core part of MTS analyzing process. Many of the research efforts in this context have focused on proposing novel similarity measures for the underlying data. However, with the countless techniques to estimate similarity between MTS, this field suffers from a lack of comparative...

متن کامل

Repeated Record Ordering for Constrained Size Clustering

One of the main techniques used in data mining is data clustering, which has many applications in computer science, biology, and social sciences. Constrained clustering is a type of clustering in which side information provided by the user is incorporated into current clustering algorithms. One of the well researched constrained clustering algorithms is called microaggregation. In a microaggreg...

متن کامل

A Hybrid Time Series Clustering Method Based on Fuzzy C-Means Algorithm: An Agreement Based Clustering Approach

In recent years, the advancement of information gathering technologies such as GPS and GSM networks have led to huge complex datasets such as time series and trajectories. As a result it is essential to use appropriate methods to analyze the produced large raw datasets. Extracting useful information from large data sets has always been one of the most important challenges in different sciences,...

متن کامل

An Optimization K-Modes Clustering Algorithm with Elephant Herding Optimization Algorithm for Crime Clustering

The detection and prevention of crime, in the past few decades, required several years of research and analysis. However, today, thanks to smart systems based on data mining techniques, it is possible to detect and prevent crime in a considerably less time. Classification and clustering-based smart techniques can classify and cluster the crime-related samples. The most important factor in the c...

متن کامل

Clustering of Fuzzy Data Sets Based on Particle Swarm Optimization With Fuzzy Cluster Centers

In current study, a particle swarm clustering method is suggested for clustering triangular fuzzy data. This clustering method can find fuzzy cluster centers in the proposed method, where fuzzy cluster centers contain more points from the corresponding cluster, the higher clustering accuracy. Also, triangular fuzzy numbers are utilized to demonstrate uncertain data. To compare triangular fuzzy ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017